PRIME CHAINS AND PRATT TREES 
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Abstract. Prime chains are sequences pi, . . . ,pk of primes for which Pj+i = 1 (mod pj) for 
each j. We introduce three new methods for counting long prime chains. The first is used to show 
that7V(a;;p) = 0^{x^^^), where N{x;p) is the number of chains withpi = p and ^ px. The 
second method is used to show that the number of prime chains ending at p is x logp for most 
p. The third method produces the first nontrivial upper bounds on H{p), the length of the longest 
chain with pk = p, valid for almost all p. As a consequence, we also settle a conjecture of Erdos, 
Granville, Pomerance and Spiro from 1990. A probabilistic model of H{p), based on the theory 
of branching random walks, is introduced and analyzed. The model suggests that for most p ^ x, 
H{p) stays very close to e log log x. 



1. INTRODUCTION 

1.1. For positive integers a and h, write a -< 6 if 6 = 1 (mod a). We are interested in properties 
of prime chains Pi -< P2 -<■■■-< Pk, e-g- 3 -< 7 -< 29 -< 59. Prime chains are multiplicative 
analogs of the well-studied additive prime fc-tuples (sequences pi < ■ ■ ■ < pk of primes with 
Pk — Pi small). Important quantities of study are N(x), the number of prime chains with pk ^ x 
(k variable), N{x; p), the number of prime chains with pi = p and Pk/pi ^ x, f{p), the number of 
prime chains with pk = p, and H{p), the length of the longest prime chain with pk = p. Estimates 
for these quantities have arisen in investigations of iterates of Euler's totient function 0(n) and 
Carmichael's function A(n) (e.g. IS, [El, Hill, [|H, m, |l29l, [l30J), the value distribution of \{n) 
||2T]| . common values of 0(n) and the sum-of -divisors function a{n) [|22l . and the complexity of 
primality certificates ([[8l, [|33ll ). 

In studying long chains, where the ratios logp^+i/ logpj are small on average, we require infor- 
mation about the large prime factors of shifted primes p — 1. That is, we require good estimates 
for tt{x; g, 1) when q is large, where 7r{x; q, a) = \{p ^ x : p = a (mod g)}|. Progress, however. 
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is hampered by our poor knowledge when g > ^Jx. Let = J^dt/logt. The Bombieri- 
Vinogradov theorem ( [|T6ll . Ch. 28) implies that 



(1.1) > max 



n{y;m, 1) - 



(j){m) 



with Q = x^^'^iXogx)^^ and R = 2;(logx)^'^ (here A > is arbitrary and B depends on A). 
The corresponding statement with Q = is not known for any fixed > 1/2, however it is 
conjectured (the EUiott-Halberstam conjecture; abbreviated EH) that ( |1.1[ ) holds with Q = and 
R = a;(loga;)"^ for any 9 < 1 and A > 0. The one-sided Brun-Titchmarsh inequality 

2x 

(1-2) vr(x; q, 1) < — - — -— -, 

[q - 1) log(x/g) 

however, is useful in some situations. 

If one asks just for the existence of many shifted primes p — 1 with a large prime factor, we can 
do a little bit better than Bombieri- Vinogradov. Let P^(n) denote the largest prime factor of n, 
and let 6*0 be the supremum of real numbers 9 so that there are infinitely many primes p ^ x such 
that P^{p — 1) ^ x^. EH implies 9q = 1, and the best unconditional result is due to Baker and 
Harman [4J, who showed that 6*0 ^ 0.677. 

In this paper, we prove new bounds for N(x;p), N(x), f{p) and H{p). At the core of our 
arguments is a kind of duality principle: in a chain pi -< ■ ■ ■ -< pk, there are integers rrij 
with = rrijPj + 1, and there is an obvious bijection between the fc-tuples (pi, . . . ,Pk) and 
(pi, mi, . . . , mfc-i). It is often more efficient to focus on properties of the latter vector rather than 
the former. 



1.2. We begin with the problem of bounding N{x;p). By iterating ( |1.2| ), one arrives at a uniform 
bound (e.g. [[T9ll , Theorem 3.5) 

(1.3) m,r.p) « £(£l2Iii^ 

logx 

for the number, Nk{x] p), of prime chains of length k with pi = p and Pk/pi ^ x. Here log;, x is 
the k-th iterate of the logarithm of x, and c is some constant. Summing ( |1.3[ ) over k ^ |^ + 1, 
one obtains the weak estimate N(x;p) <^ x^^^°^'i^\ 

Theorem 1. For p ^ 2 and x ^ 20, we have the effective estimate 

N f lOgxflOgo X + 0(1)) 1 , , 

I log2 X J 

In particular, for every e > there is an effective constant C {e) so that N{x; p) ^ C {e)x^~^^. 

Theorem [T] has applications to problems which, at first glance, have nothing to do with prime 
chains. First, it is a crucial tool in the recent proof by Ford, Luca and Pomerance fTT\ that the 
equation 0(a) = a{b) has infinitely many solutions, settling a well-known 50-year old prob- 
lem of Erdos. In llITI . Theorem [T] is used to show that for some effective qo, if 7r(p^";p", 1) — 
7r(p^"; 1) ^ 113p^^ I log(p"~'"^) for all prime powers p"" E (10^°, qo], then for every positive 
integer n, there is another positive integer m with \{n) = A(m). This nearly settles a conjecture 
from [5J, the analog for A of the famous Carmichael Conjecture for 0. 
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Theorem [T] is nearly best possible, since N{x;p) ^ N2{x;p) = Tc{px;p, 1), which is expected 
to be ^ x/{logpx) unless x is very small relative to p. 

Conjecture 1. We have N{x; p) <ti x. 

Conjecture [T] is easy to prove when p is bounded. Using /(2) = 1 and the recursive formula 

(1.4) /(p) = l+ ^(^)' 
we have 

(1.5) /(P)^^-I (allp). 

log 2 

Summing onp ^ x using the prime number theorem gives N{x) <^ x and hence N{x] p) <^ px. 
Lower bounds on f{p) and N{x) are more difficult, since f{p) is sometimes very small, e.g. if 
p = 1 + 2"3'' then f{p) =4 (it is conjectured that there are infinitely many such primes). 

Theorem 2. (i) We have f{p) ^ 0.378logp for almost all primes p. Hence, N{x) ^ x. 



(ii) For all X ^ 3 and any positive integer h, \{p ^ x : f (p) = h}\ ^ 



6 logx 
h 



h 



In particular, part (ii) implies that primes with f{p) = o(logp) are exceptionally rare, the count- 
ing function being x"*^^^ as x — t- oo. Also, by (i), we have N{x] 2) = ^N{x) ^ x. 

It is likely that / (p) has normal order ^ c log p for some c. A very similar problem was considered 
in Section 2 of [|T9ll , namely the behavior of I{n) = min{j : (pji^n) = 1}, where (pj is the j-th 
iterate of 0. It turns out that F(n) = I{n) — {1 " is completely additive, and ^(p) = F{p—1) = 



Sg^llp-i (^^il) for odd primes p. This is similar to ( |1.4[ ), the only difference being the behavior 
at proper prime powers, which play an insignificant role in the arguments in liT9ll . Summing ( |1.4| ) 
over primes p ^ x gives 

N{x) = 7r{x)+ J2 f{q)^{x;q,l). 

Inserting this relation into the proof of Theorem 2.1 in [fT9ll (combine Lemma 2.4, Corollary 2.5, 
(2.8) and Theorem 2.1 therein), we obtain the following. 

Theorem A. If ( [TTT] ) holds with Q = x^'^^"^^""^'^'" and R = x(loga;)~^ /or some fixed 5 > Q, 
then N{x) ^ cxfor some constant c > and f{p) has normal order clog p. 

Conjecture [T] implies that for all £ > and prime q > (logx)^^'^, for most p ^ x there is no 
prime chain g -<■■■-< p. This gives, conditionally, the first part of [fT9l Conjecture 1]. By contrast, 
the proof of Theorem 4.5 of [19] implies that if g ^ (log x)^, for some small constant c > 0, then 
for almost all primes p ^ x, there is a prime chain q -<■■■-< p. 



for every e > 0, \f{p) — c \ogp\ ^ £ logp for all primes but o['k{x)) exceptions up to x. 
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Pratt tree height, primes < 1,000,000,000 



Pratt tree height, random primes near 10'^40 
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Figure 1 . Pratt tree height 



1.3. The Pratt tree T{j>) for a prime p is the tree with root node p, below p are nodes labelled 
with the prime factors qofp— 1, below each q are nodes labelled with the prime factors of g — 1, 
and so on. In 1975, V. Pratt [.33| used it in conjunction with Lucas' primality test ( [,15,1 . §4.1) to 
show that every prime has a short certificate (proof of primality). Pomerance ||32ll gave another 
method for producing primality certificates, but it is an open problem whether the Pratt certificate 
has longer complexity for most primes (see §1 of ll32l ). Two important statistics of the Pratt tree 
are the total number of nodes f(p) and the height H{p), the latter being the length of the longest 
prime chain ending at p. It is known (see (71, [fTSl . [|20ll ) that the number of primes at a fixed level 
n in the Pratt tree for most p is ~ {log2 p)"'/n\. The idea is that for most primes p, p — 1 has a 
multiplicative structure similar to that of a typical integer of its size; namely, p — 1 has about log2 p 
prime factors, uniformly distributed on a log log-scale (see fl25ll . Ch. 1). This, however, does not 
give much information about H(p). 

Figure 1 shows histograms of H{p) for all primes p ^ 10^ and for 1000 randomly chosen primes 
near 10"^°. Very little is known about the distribution of H{p), the extremal behavior being a case 
in point. First, H{p) = 2 if and only if p is a Fermat prime, that is, p = 2^™ + 1 for some m. It 
seems plausible that H(p) = 3 for infinitely many p, but this is hopeless to prove at this time. At 
the other extreme, we have the trivial upper bound H{p) ^ |^ + 1. Large values of H{p) may be 
obtained using the special chain 2 = qi -< q2 ^ ■ ■ ■ , where, for each j, qj^i is the smallest prime 
= 1 (mod qj). By Linnik's theorem, g^+i ^ q^ for some constant L, hence H{qj) ^ 
It is conjectured that qj^i ^ QjO-Ogqj)'^ for some fixed C and this implies a far stronger bound 
H{qj) ^ logiogg • Even showing H{p) / \0g2P — > 00 for an infinite sequence of p is extremely 
hard, as it implies 6*0 = 1: a prime chain Pi -<■■■-< Pk = P with k = H{p) satisfies 



logPfc -p-r logpj+i / . logpj+i^ 
logpi logpj \l^j^k-l logpj ^ 



and hence 



(1.6) A :=limsup-^^ < ^ 



log2 p - log 6*0 
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On the other hand, Katai El proved that for some constant c > 0, H{p) ^ clog2P for almost all 
primes p (for all primes p ^ x with o{x/ \ogx) exceptions). We prove a version with the constant 
made explicit in terms of the level of distribution of primes in progressions. 

Theorem 3. (a) If ( |1.1| ) holds with Q = x^ and R = o(x/logx), then for any c < ^-ili^gO ' 
H{p) > c\og2P for almost all primes p; 

(b) If ( |1.1[ ) holds with Q = x^ and R = x{\ogx)~^ for every A > 1, then for every c < ^^^g , 

there is a K so that H{p) > cloggp/or ^ x/ (log x)^ primes p ^ x. Consequently, A ^ zr^tge- 
Corollary 1. EH implies that for every c < e, H{p) > c log2 pfor almost all p. 

Remark 1. By the Bombieri- Vinogradov theorem and ( |1.2| ), for every e > 0, ( |1.1| ) holds with 
Q = and R = Os{x/ logo;). 

If A ^ —1/ log 9 for some 9 < I, then there are many chains 2 = pi -<■■■-< p^ with '"^^^"^^ at 
most 1 /9 on average. Thus, although ( |1.6[ ) is likely an equality, to prove this would require strong 
information about the set of primes with P+(p — 1) near p^". The same difficulty arises when trying 
to prove (a) with 9 = 9q. 

The bound on H{p) in (a) is weaker than in (b), since it is unusual in a chain pi -< ■ ■ ■ -< Pk 
for most of the ratios '"^^^"^^ to be close to 1/9. The constant e^^ appearing in (a) is likely best 
possible; see Conjecture|2] below. 

Turning to upper bounds, before our work it was unknown if there is an infinite sequence of 
primes with H(p) = o(logp). A natural approach is to find p such that P^(p — 1) is small and 
use H(p) = 1 + maxq|(p_i) H{q) <^ maXg|(p_i) logg. However, our knowledge of smooth shifted 
primes is very weak (the world record is P^{p — 1) < pO-^sei infinitely often [|4j|). Using a new 
and very different approach, we give a much stronger upper bound. 

Theorem 4. Wehave H{p) ^ (log p)°-^^^^ for almost all p. 

The proof of Theorem|4]involves showing that for most primes p, all the primes at some bounded 
level of the tree are small. In particular, this settles [fT9l Conjecture 2]. 

Theorem 5. For every e > and 6 > 0, there is an integer k so that for large x and at least 
{1 — 5)x integers n ^ x, P~^{(j)k{n)) ^ x^. 

There is a folklore conjecture that H{p) = 0(log2p) for most p. 

Conjecture 2. H{p) has normal order e logg p- 

Remark 2. The lower bound in Conjecture [2] follows from EH (Corollary [T]). Conversely, by ( |1.6[ ), 
the lower bound in Conjecture [2] implies that 6^0 ^ e~^^^ = 0.6922 . . . (with a bit more work using 
( |1.2[ ), one can deduce 9q ^ 0.73). 

The upper bound in Conjecture [2] appears to be even more difficult. We cannot see a way to 
deduce it from standard conjectures in prime number theory, e.g. EH plus a uniform prime k- 
tuples conjecture, although Theorem|4]can be significantly improved under such hypotheses. 

Understanding H(p) requires detailed knowledge of the distribution of the large prime factors of 
shifted primes p — 1. Making a reasonable assumption for this distribution (a consequence of EH), 
in Section |6] we model the Pratt tree by a branching random walk. The model provides a much 
more precise version of Conjecture |2j 



6 



KEVIN FORD, SERGEI V. KONYAGIN AND FLORIAN LUCA 



Conjecture 3. H{j)) = e log2 P — § logs P + ^ip)' where for some fixed c,c' > and any z ^ 0, 
the number of p ^ x for which E{p) ^ z is ^ e~^'^7r(x) and <^ e^^^n{x), and E{p) ^ —z for 
0(exp{— e'^^}7r(a;)) primes ^ x. 

Notable features of Conjecture |3] are (i) the tightness of E{p): the distribution of H{p) over 
p ^ X does not widen as a; — )■ oo, and (ii) the pronounced asymmetry of the distribution of E{p). 
The analogs of these features for our probabilistic model are proved rigorously. 

Unconditionally, we show that H{p) is tight to the left of its median. 

Theorem 6. Suppose g and h are increasing, ^ g{x) ^ h{x), /i(x^) — h{x) ^ K and g{x'^) — 
g{x) ^ K for X ^ 1. Suppose, for large x, that H{p) ^ h{p) for at least en (x) primes ^ x. Then 
H(p) ^ h{p) — g{p)for all primes p ^ x with at most 0{tc{x) exp{ — ^^j^g{x)}) exceptions. 

We conclude this section with a conjecture about prime chains, which follows from the prime 
/c-tuples conjecture (with m = 2 below) but should be "easier". It is a multiplicative analog of 
the statement that the primes contain arbitrarily long arithmetic progressions, recently proved by 
Green and Tao [|23l . Even the case A; = 3 is not known. 

Conjecture 4. For each k ^ 3, there are infinitely many prime k-tuples {pi, ■ ■ ■ ,Pk) where, for 
some m, pj+i = mpj + Ifor 1 ^ j ^ k — 1. 

Notation. The letters p and q, with or without subscripts, always denote primes. Constants 
implied by the O— and <^ —symbols do not depend on any parameter unless indicated. In Section 
[6} we use P for probability and E for probabilistic expectation. 

2. Sifted chains: proof of Theorem [H 

The underlying idea is a sieve; relax the condition that the numbers in the chain are prime, and 
only require that they do not have small prime factors. Let y ^ 2 and let r be the product of the 
primes ^ y. For (a, r) = 1, let Gaix; y) be the number of chains rii ~< ■ ■ ■ ~< rik with rii = a, with 
nk/ni ^ X and consisting of numbers coprime to r. If p > y, then N(x;p) ^ Gp{x, y). There are 
integers ("links") mi, ... , mk-i with rij+i = rnjUj + 1 for 1 ^ j ^ /c — 1. For positive integers 
a, h and real s > 1, 

S{a,h) = S{a,h]r, s) = 

am,+l=6( mod r) 

encodes the possible links m from a number Ui = a (mod r) to a number rij+i = b (mod r). 

Fix r, s and let Ur = (Z/rZ)*. Let Ak{ai, a^) be the sum of (mi ■ ■ ■ m^-i)^'* over all tuples 
(mi, . . . , nik-i) which could serve as links in a chain starting from a number ni = ai (mod r), 
ending with a number Uk = ak (mod r) and with all numbers in the chain coprime to r. Then 

^2(01, 02) = S{ai, 02) and for A; ^ 3, 

Ak{ai,ak)= ^ ^(ai, 02)5(02, as) ■■■ 5'(afc_i,afc). 

a2,...,afc-ie(7r 

Let Vfc(ai) be the column vector {Ak{ai, at) ■ at G Uy). For consistency, let Vi{ai) be a vector 
with all zero entries except for an entry of 1 in the ai position. Since 
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we obtain VA,.+i(ai) = M\4(ai), where M = M{r,s) = (^S{a,b))^^^^ . The rows of M axe 
indexed by b and the columns are indexed by a. Finally, let Ffc(ai) = Afc(ai, a^), so that 

Ffc(ai) = (1, . . . , l)Vk{a^) = (1, . . . , 1)M'=- Vi(ai); 
i.e., Fk{ai) is the sum of the entries of column ai in M^~^. Since mi ■ ■ ■ ruk-i ^ nk/ui ^ x, 

(2.1) Ga{x;y)^M(x' Yl ^'^(«))- 



Observe that the sum on k in ( |2.1| ), if extended to /c = oo, is convergent if and only if M is a 
contracting matrix, i.e., all the eigenvalues of M have modulus < 1. Since M has positive real 
entries, the Perron-Frobenius Theorem implies that the eigenvalue with largest modulus is positive, 
real and simple. Call this eigenvalue \{s; y). 

We show below that if y is large and s ^ 1 + 7)^, then \{s;y) < 1. Accurate estimation 
of A(s; y) is difficult for large y, but the largest row sum of M serves as an upper bound. For a 
generic matrix A, let Rb{A) be the sum of the entries in the row indexed by b, and let R{A) be the 
maximum row sum of A. For row b of M, write d = (b — l,r) and b' = Then 

R^(M) = J2 = k-'#{a eUr:ak = b' (mod r/d)}. 

adUr am=6— l(mod r) (k,r/d)=l 

The congruence ak = b' (mod r/d) has a unique solution modulo r/d, and hence has 0((i) solu- 
tions a E Ur- Thus, 

Therefore, since d is always even. 



(2.2) ^(^) = ^73111(1 



p>y 

Since i?(A5) ^ R{A)R{B), R{M''-^) ^ R{M)''-\ To bound G„(x; y), we need to bound the 
largest column sum of M^~^. Lacking a better approach, we use the crude bound (j){r)R{M)^^^ . 
Thus, Fk{a) ^ (l){q)R{M^-^) ^ 0(r)i?(M)'^-i. By dTT] ), 



Gaix;y) ^ 0(r) inf 



s:R{M)<l 1 - i?(M)' 

By standard prime number estimates, if 1 < s ^ 2, then 



g-(s-l)logj/ 



-5^1og(l -p-^) = 0(l/y^-i) + J2p'^ « 7 • 

[s — I) logy 



p>y p>y 



Take y = ^ and s = 1 + ^g^. Since 2^ - 1 = 1 + (2 log2)(s - 1) + 0((s - l)^), ^ implies 

1 — R(M) ~ (2 log 2) (s — 1) as X —7- 00. Since 0(r) ^ r = e'^^^''^-'^))^ as x — i- 00, this proves 
Theorem [TJ 
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3. Proof OF Theorem [2] 

Let Q{p) be the multiset of prime labels appearing in T{p), and let T'{p) be the subtree of T(p) 
consisting of nodes with odd prime labels. There is a natural bijection between T(p) and T'(p), 
obtained by adding to every node in T'{p) a child node with label 2. Let l{n) = Y[p'^\\nP^^^ ■ The 
quantity -^l{q — 1) measures the "loss of mass" when descending from a node labelled q to its 
child nodes: in fact, it is easy to see that 



n 



^ l{q-l)]=p. 



1 

If p > 2, exactly half of the nodes in T(p) are labelled with 2, thus 
(3.1) W l{q-l) ^ j92-^/(p). 

Consider the set oi p ^ x with f{p) = h, where h = 2n is positive and even. Let T be the 
set of rooted trees on n nodes. For each T' E T, we consider separately the primes p with T'{p) 
tree-isomorphic to T'. Form the tree T on h nodes, by adding to each node of T' an additional 
child node. We count how many ways we can label with primes the nodes of T, with the leafs 
having label 2 and the root having label p ^ x. For a given prime p, there may be more than one 
way to assign primes in the Pratt tree to the nodes of T'; this occurs when some node has two or 
more child trees that are isomorphic (as rooted trees). Thus, for each T', we count ways to assign 
primes to the nodes, and divide by the number of ways in which we can permute the nodes; that 
is, the number I{T') of isomorphisms of T'. Assign to each node an ordinal number 1,2, . . . , h so 
that the children of node j are assigned lower ordinals (e.g., node 1 will be a lowest leaf, and node 
h will be the root). To node number j, we let qj be its prime label and Ij = l{qj — 1). Let Bj be 
the set of ordinal numbers of the children of node j, and observe that Bi, . . . ,3^ depend only on 
T' . With this notation. 



(3.2) - 1 = /, n 



qk- 



With T fixed, p.2| ) implies a natural bijection between (gi, . . . , g^) and (Zi, . . . , Ih) (recall that 



leafs have qj = Ij = 2). By p.l[ ), we have for any (3 > 



T'eT ^ ' li,...,lh ^ ^ 

Suppose that j ^ 2 and that /i, . . . , have been chosen. If node j is a leaf of T, then Ij = 1. 
Otherwise, using p.2[ ), the primes qk for k E Bj are determined by /i, . . . , Moreover, a prime 
r\lj must equal one of these primes q^. Hence 

E'7'< n (i-a-y « (i-2-^)-'(i-3-")*-™. 
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By Borchardt's formula^ lfT3l for counting labelled trees, 



E 



n 



n-2 



n 



n-l 



^^„^,^ I{T') (n-l)! 
Since X^iB^i^sid-^jl ^ 1) = — 1. we conclude that 

(3.3) \{p^x: f{p) = h}\ ^ {x2-''/^f (l - 2-^)""/' (l - 3-'^)""/' 

Taking (3 = 0.37, the right side of ^3} is ^ x°-3^(27.8371)^/2 ^ x^-^^^ for /i ^ 0.378 log x. For 
the second part of Theorem |2| assume h ^ (2/5) logx (if /i ^ 3 logx, there are no such p and for 
(2/5) log X <h^3\ogx, (»M^)^ > x). Take/3 = /i/loga;- Since < /3 ^ 2/5, 2'^-l ^ /31og2 
and 1-3-^ ^ 0.889/3. Hence, the right side of ([33]) is ^ x''(4.412//32)V2. 

We remark that numerical improvements are possible by refining the above analysis; e.g. using 
the fact that leafs of T' must be labelled with a Fermat prime. 

4. Lower bounds for H{p): proof of Theorem[3] 
We show part (b) first, as the proof is much easier. 

Proof of Theorem^(b). Let c' and 9' satisfy 9' > 1/3 and c < c' < _ J^g, < and define 

K by 89^ = 9 — 9'. Let Xq be large, depending on K, c, c', 9, 9' and put c" = c' log2(a;g). Let 
V = {p : H{p) ^ c'log2P — c"}. In particular, V contains all primes ^ Xq. We shall prove 



(4.1) 



Q{x) := \Vn{x/2,x] \ ^ 



X 



(logx)^ 

for X ^ xq, which implies the desired conclusion (since c' > c). By the prime number theorem 
and the fact that > 1, if xq is large enough then ( |4.1[ ) holds for xq ^ x ^ x^. Suppose y ^ x^ 
and ( |4.1[ ) holds for xq ^ x ^ y. Assume y < x ^ 2y and put I = V H {x^ , x^]. Suppose that 
x/2 < p ^ X, and that q\p — 1, where q E I. Then 

H{p) ^ 1 + H{q) ^ 1 + c' log2 q - c" 

^ 1 + c' log2 x + c' log 9' - c" > c' log2 p - c", 

so that p E V. For x/2 < p ^ x, p — 1 is divisible by at most two primes from J, hence 



X 



(log x; 



Since (4.1 1 holds for x < ?/ ^ x , the sum on g G / is 



E 



g(2^ 

2Jx^' 



^^') log X 



log 2 



- 1 



1 



(logx^ 



9-9' 



8 



^^^(logx)^-i (logx) 



K-\ ' 



Therefore, (4.1 ) holds. By induction on dyadic intervals, (4.1 1 holds for all x ^ xq. 



□ 



commonly known as Cayley's formula. 
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Remark. The same proof gives, assuming that holds with Q = a;i~^(^) and R{x) = 
x(loga;)~^^^^ where e{x) — t- and g(x) — > oo as ,t — t- oo, that H(p) ^ h{p) log2P for infinitely 
many p, where h{p) — )■ oo as p — )■ oo (the function h depending on the functions e,g). 

Proof of Theorem^(a). We proceed by induction as in part (b), but instead we iterate by many 
levels in the chain at once rather than one level at a time. Suppose that c < h < c' < l/(e^^ — 
log(6')). For some constant c" , described below, let V = {p : H(p) ^ c'logaP — c"}. We will 
show, for some 5 > 0, that 

(4.2) V(x) ■.= \{p^x -.peV}]^ 6-^. 

log a; 

Consequently, a positive proportion of primes p satisfy H{p) > h log2 p, and Theorem|3](a) follows 
from Theorem |6l 

By Stirling's formula, there is an integer k ^ 2 such that 1/c' > |(A;!)^/'^ — log(6'). Let a, (3 
satisfy e'''^''' < (3 < 9'' exp (— (A;!)^/'') and (3exp ((/c!)^/'') < a < 9''. Suppose that 6 is suffi- 
ciently small, depending only on the choice of c', 9, k, a, (3. Let xq be sufficiently large, depending 
on c', 9, k, a, (3, 5, and put c" = c' log2(xo). Observe that ( |4.2[ ) holds trivially for 2 ^ x ^ Xq, pro- 
vided 5 is small enough. Throughout this proof, constants implied by the O— and ^ —symbols 
may depend on c', 9, k, a, (3, but not on 5. 



Next, suppose that Y ^ xq and that inequality ( |4.2[ ) holds for 2 ^ x ^ F. Let S* be a subset of 
the primes in V n [F^, Y'^ ]. Let M{S) be the number of primes po G (Y, 2Y] so that there is a 
prime chain pk -< Pk-i -<■■■-< Po vvith pk G S. For such pq, we have 

H{po) ^ k + H{pk) ^ k + c log2 Pk - c" 

= c' log2(2r) - c" + c'logP + k + ( 1 > C log2 Po - c" 



IokY 



if Xq is large enough. We will show, for appropriate S, that 
(4.3) M{S) ^5 ^ 



loffF' 



which imphes P{2Y) ^ PiY) + M (5) ^ 25Y/ log(2r). By induction over dyadic intervals, ( |43] ) 
implies ( |4.2[ ), and hence Theorem |3] (a) 



To prove ( |4.3| ), we will consider chains satisfying not only pk G S, but also 
(4.4) Pj+i^p' (O^j^A;-!), pi^Y'. 

With (|44j), we can use ( fTT] ) to accurately count such chains. We have M{S) ^ Mi{S) - K'hi^S), 



where Mi{S) is the number of chains satisfying pk E S and ( |4.4[ ), and M2(S') is the number of 



pairs of distinct chains satisfying these conditions with the same Pq. We begin with 

^i(^) = E E • ■ ■ E(^(2r,pi, 1) - 7r(r,pi, i)") , 

Pk&Spk-i Pi ^ ^ 

where Pa; -<■■■-< Po and ( |4.4[ ) in the summations. By ( |1.1[ ) and induction on 1 ^ j ^ A;, 



(4.5) M,(S)-— +0^— 
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Here, we used that for each pj, there are 0(1) chains Pk -<■■■-< Pj with ^ F^. By ( |4.5| ) with 

3 = k, 

^ ^ ^' \ogY Jy, {k-l)\t\ogt \\ogY ) 

5Y {\og{a/P)f f Y 
logF k\ \logY 

By hypothesis, \og{a/ (3) > (fc!)^/'^. The summands in ( |4.5[ ) (with j = k) are x l/pk for ^ 
Pa: ^ Hence, if 5 is small enough, there is a set S C P fi [F^, F"] such that 



(4.6) Y. — ^^^ ^i^*^) ^ ('^ + '^'^') 



We have M2 = M2,o + ■ ■ ■ + M2,fc_i where M2J counts pairs of coupled chains 

Pk^ < Pj+i 

jPj-< < Po 

P'k^ < P'j+i 



with each of the two chains satisfying ( |4.4[ ), pj+i ^ p'^^^ and Pk.p'k ^ "S*. We further write 
= M'^ j + M'^ -, where M!^ - counts pairs of such chains withp^ ^ pjj^ip'-^^Y^^ . As before, for 

each pair (p^+i, p'j+i), there are 0(1) choices for Pk.p'k, ■ ■ ■ ,Pj+2,p'j+2- For ^2,0' PiP'i ^ Y^'^', 
and for eachpo^ there are 0(1) choices for By sieve methods (e.g. Theorem 2.2 of [24]), 

^2,0 < \{n^Y -.71 = 1 {mod k),p-{n{^))>Y^}\ 

^ ^ , 0(A;)log2F ^ 

Here, P^{m) is the smallest prime factor of m. For j ^ 1, an argument similar to that leading to 
( |4.5| ), followed by the same sieve bound, gives 



log Y ^ ^ Pj ^ ^ n log Y 

Pj fc^y* n^y,n=l (mod fc) 

P-(n(2i^)>y'3 

For chains counted by Ml^j, the Brun-Titchmarsh inequality suffices for the estimations. When 
j ^ 1 and Pj is given, as before we have 

5^---5^7r(2y,pi,l)«— 
^-^ ^-^ Pi log r 

Pj-i pi ' J ^ 



By partial summation and ( |1.2| ), given pj+i and Pj+i, 



<^ 1 ^ log(l/^) <^ log(l/^) 

p^. Pj Pj+iP'j+i^^'^ogY Pj+ip'j+i Pj+iP'j+i 
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For j + l^r^A; — 1, 
(4.7) 



Pr 



< . 

Pr Pr+l 



^ 1 1 



Finally, by ( |4.6[ ), we arrive at 
(4.8) 



Y 



logy 



In a similar way, when j = 0, we have by ( |1.2[ ) and partial summation, 

log(l/5) 



7r(2F,M,l)« 



piPi 



P2P2 



A second application of (4.7 1 then gives (4.8 1 in this case 



Finally, combining our estimates for M2 and M2 j, we obtain M2{S) <^ 6 \og{l/ 6)Y/ \ogY. 



Together with (4.6), if 5 is small enough then (4.3 1 holds, and this completes the proof. 



□ 



5. Proof OF Theorems m AND [5] 

The proofs of Theorems |4] and |5] rely on the fact that the largest prime factor of p — 1 cannot be 
too large too often. At the core is a sieve upper bound for A;-tuples of primes which is uniform in 
k, and careful averages of the associated singular series. There is a potentially troublesome factor 
2''k\ in the sieve estimate, which is partly overcome by observing that if H{p) is large, then there 
must be a prime chain in the Pratt tree for p which is very condensed in a multiplicative sense. 



Lemma 5.1. There is a positive constant S so that the following holds. Let ai, . . . ,ak be positive 
integers, let bi, . . . ,bk be integers with {aj ,bj) = 1 for all j, and letC, (p) be the number of solutions 
h) = (mod p). Ifx ^ 10, 1 < A; ^ 6^ and 



B := logj9 ^ 6 logx, 



p 



then the number of integers n ^ xfor which ain + bi 



2^k\ 
(logx) 



-x& ■ exp I O 



kB + k"^ log2 X 
logx 



6 



Ofcn + bk are all prime and > k is 

n 



p 



1 



Proof. Since ^{p) = k for large p, (5 > if and only if ^{p) < p for all p. Also, ^(p) ^ k for all 
p. Hence, if (5 = 0, the number of n is zero. If (5 > 0, Montgomery's large sieve estimate [fT2l 
Theoreme 6] implies that the number of n in question is ^ x/G{^/x), where 



p\n 
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and n is the Mobius function. For fixed k, the argument in [[24l §5.3] gives G{z) ~ (log z)^ / {k\&). 
We sketch how to make explicit the dependence on k. By the argument in [[24l p. 147-148], 

p>z//i 

= Yl aid) log ^ + O (G(^) (5 + k log2 z)) . 

Adding the sum on the right side to both sides yields 

G{z)\ogz = {k + l) [ ^ dt + r{z)G{z) log z, 

where r{z) -C ^^iJ^^^ ^ • If ^ is small enough and z ^ ^/x, then \r{z)\ ^ |. By the argument in 



p. 150], for some constant D and for z ^ ^/x 



M , ^^G{z) ^ f ^ f kB + k'^log^z 



log z [ \ logz 

By the argument on [|24l p. 151-152], D^^ = k\G. Taking z = y/x completes the proof. □ 



For given integers mi, . . . ,mfc_i ^ 2, we will apply Lemma 5.1 with the forms fi{n) = n, 
fj+i{n) = nijfjin) + 1 (1 ^ j ^ A; — 1). We have fj{n) = ajn + bj, where 

i-i 

(5.1) ttj = mi--- mj_i (j ^ 1), bi = 0, bj = l + ^rrii - ■ ■ nij-i (j ^ 2). 

1=2 

Clearly, (aj, 6^) = 1. Let ©(m) = © be the associated singular series and let ^(p, m) = i{p). 
Lemma 5.2. There is a positive constant ci so that ©(m) <^ (ci log2(4mi ■ ■ ■ nik^i))''^^. Also, 

V ^"^^^'""^ logp ^ A; (log2(4mi ■ ■ ■ m,^,) + 0(1)) . 
p 

Proof. We have ^ (p, m) = kifp\N, where N = rrii ■ ■ ■ mk-i Y[i<j I '^i^j ~ '^j^i I ■ Also, ^ (p, m) ^ 
1 for all p. Let x = mi - ■ ■ nik_i ^ 2^^^^. By ( |5.1| ), ^ x and 6j ^ 1 + X]j=i ^/^'^ ^ ^ ^^'-h 
J. Thus, iV ^ ^ exp{0(log^x)}. Since l-k/p^{l- Ijpf forp > A;, if ©(m) > 

then ©(m) ^ Hpiwll ~ 1/p)^^^ ^ [c\ logg A^)^~^ for a constant Ci. The second bound follows 
from Ej,|^l/p^log2iV + 0(1). □ 

Lemma 5.3. Let k ^ 1 andZ C {1, . . . , k—1}. If the variables rrii are fixed (1 ^ z ^ k—1, i ^ X), 
then for any prime p, 

e(p, (mi, . . . , m,_i)) ^ - (p - 1)1^1+^ 

Proof Fix p and let N{k,X) be the sum on the left side. We use induction on k, the case 
k = 1 being trivial. Suppose k ^ 2 and the lemma holds with k replaced by A; — 1. If A; — 
1^1, then ^(p, (mi, . . . ,mfc_i)) ^ ^(p, (mi, . . . , m^.a)) implies N{k,X) ^ iV(A; - 1,X). 
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If k — 1 e X, then N{k,X) counts the number of (|X| + 1)— tuples (mi(i e 1),n) modulo p 
with p|/i(n) • • • /fc_i(n)(mfc-i/fc_i(n) + 1). The number of tuples with • ■ • fk-iiji) is 

pN{k — 1,1 — {k — 1}) and the number of remaining tuples is p'-^' — N{k — l,X—{k — 1}). By the 
inductive hypothesis, N{k, I) = pl^l + (p - l)N{k - 1, X - {/c - 1}) ^ pl^l+i - (p - □ 

Lemma 5.4. Let k ^ 4, an J suppose that Mi, Ni are integers satisfying Mj ^ 2 and 2 ^ Ni ^ 
2kMifor 1 ^ i ^ k — 1. For some positive constant C2, we have 

J2 ©(m) « Ml • • • M,_, (C2 log kf exp \o C^^) ] , 

Ni<mi^Ni+Mi ^ V ^Og/v / J 

where b is the number of variables Mj which are ^ 2^^ ^. 

Proo/ Let L = [log A;J + 1 and r = k'^L. We will perform a precise averaging of the factors in 

©(m) for primes p ^ r, and use crude estimates for larger p. If p \ mi ■ ■ ■ rrik-i, each congruence 
fj{n) = (mod p) has exactly one solution. For h > j, fj{n) = (mod p) and /^(n) = 
(mod p) have a common solution if and only if p\ {ajbh — ahbj). Write 

h-l 

(5.2) ajbh - ahbj = mi • • • mj_ig'j-,fe(m), 5'j>(m) 1 + ^ • • • m,j_i. 



Define 



We then have 



Let 



p|n p>r,p|gj.ft(m) 

ptmi-'-m/c-i 



n 0-^)0 n 

p>r 



J = {{j,h):l^j<h^k-l,h^j + 2, max Mi > 2''' , 



and put J = ^ (k-3Kk-2) _ ^^^^ letX = {i : > 2^'i°g''^}. Write X = Ua=i([^a, ^1] ^ N), 
where i^+i ^ + 2 for each a. For each a and 2 + ^ /i ^ i^+i (so h ^T), 
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Since Gh ^ Q^2^^^°^'^ , Gh has 0{k^ log^ k) prime factors, and thus for some constant C > 1, 
tpriGh) ^ ^w{J2p\Gh p>r ^ There are b such numbers h. Hence, by Holder's inequality. 



^6(m)<C'J]n(l 



(5.3) 



m p^r 

<c'(Efn(i 



P 



1 - 



P 



1 - 



-k k-l 



2L{fc-l) 



1=1 



ki ^\ 1 



X 



n E^^w^'^' 



i=l \ m 



2JL 



If we write ip^^ = 1 * (3s, then (3s is multiplicative and supported on square-free integers composed 
of primes > r. Furthermore, if p > r ^ s + 1, then 



(5.4) 

Thus, for each i. 



Up) 



p 



p — 1 



1 < Q^/iP-^) _ 1 < 



4s 
P ' 



J2 ^ Yl P^k-iAd) 



(5.5) 



Ni<mi^Ni+Mi 



d 



d^Ni+Mi 

^ {2k + 1)M, n (l + « A:M,. 

p>r ^ P J 



For fixed (j, /i) G J', let M; = max(Mj+i, . . . , M^.i) > 2^' ^ and write 
(5.6) 9j,h{™) = mim+i ■ ■ ■ mh^i)gj^i{m) + fi';,fc(m). 

We'll use 

G,,,(m)|G';.,(m) := J] 

P>»'.p|S:i,h(m) 
ptmi--mfe_ig;,h(m) 

and note that G)^^{m) ^ ^j>(m) ^ k{2k + l)''Mj+i . . . Mh-i ^ {QkMif by (|5]2]). Fix all of 
mj+i, . . . , m/i_i except for m/. By ( |5.4[ ) and ( |5.6| ), 



(5.7) 



2JL 



nil 



/32JL(ci) 



rf|G' (m) 



p>r 



1 



8JL 

p2 



n 

r<p^(6kMi)'' 



8JL 
P 
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Also, 



P 



P 



k 

' L-1 



exp < O 



k log2 k 
log k 



Therefore, by p3h, p5} and (5.7), 



^6(m) < (Mi---Mfc_i)i5^-Eexp|o 



k log2 k 
log k 



where 



m p^r 



P 



P 



Let M' = Ylp^^p. Since M' ^ e^'', for each i e X, Mj > A;M'. Hence, the number of G 
(iVj, + Mi] lying in a given residue class modulo M' is ^ Mi/M' + 1^(1 + 0{l/k))Mi/M' . 
Thus, by Lemma [53] and the Chinese Remainder Theorem, 



Ni<mi!iNi+Mi ieX 



rdi mod M' P^r 
k 



p\i\ yp — \ 



P 



PI 



m 



0^r?ii<p 

(iex) 



< Ml ■ ■ ■ Mfc_i JJ 

p^r 



P 



fc-l-|X| 



Noting that 6 = — 1 — |X|, the lemma follows from Mertens' estimate. 



□ 



Theorem 7. Suppose that r] > 0, r ^ 1, and I and x are sufficiently large as a function ofr]. There 
are 



«-^(2r/ei+'')'/2 + Vx(log2x) 
log a; ^-^ 



oiji) 



3+»7\ LiV(iog2iO J 



logx 



primes p ^ x, such that there is a prime chain p^i -< Pri-i ^ ■ ■ ■ -< Po = P with p^i > x^'^'^^^ 

Proof Suppose < 1 and rl ^ {\ogxy^^'^^'^\ else the theorem is trivial. Put kj = jl and 

Xj = x^^'^^^ ' for ^ j ^ r. Suppose p ^ x and there are even integers hi, ... , hk^ so that 

(5.8) p = pq = hipi + 1, = h2P2 + !,•••, Pkr-l = hkrPkr + 1, 

with po,. . . ,Pkr prime and pk^ ^ Xr- The vector {hi, ... , hk,. ) may not be unique, but we associate 
to each such p a single such vector. Each p lies in Qi U ■■■ U Qr, where Qj is the set of primes p 
so thatpfc^ < Xi {i < j) andpk^ ^ xj. By assumption, kr ^ {logXrY^^. 
Fix j and even integers hi, ... , hkj satisfying hi - ■ ■ h^. ^ x/xj. By Lemma 5.2, 



^kj-i{p-{hk^,...,hi)) , 
y ^ logp < kj log2 X < (logXr) ' log2 



X. 



p 
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By Lemma [5T| and Stirling's formula, the number of p = po ^ x satisfying ( |5.8| ) is 



(5.9) 



hi---hk 



{\ogXj 



Let 1 ^hj ^ kj be a parameter to be chosen later, and put Aj = 2^^^ . Let Qj i be the set of 
p E Qj for which at least bj of the variables hi, ... , h^. are ^ Aj, and (5j,2 = Qj\Qj,i- 

To estimate fix a set C {1, . . . , fc^} of size bj so that hi ^ for each i E B. Let 

X = {1 ^ z ^ fcj : z i3} and, for ^ i ^ j - 1, put = \B n {ki + 1, . . . , fci+i}], 
Ij = X n {/cj + 1, . . . , /cj+i}. By the definition of Qj, 



(5.10) 



Yl hg^hk^ + l---hk^ ^ ^ 



Since hi ^ 2 for all i. 



(5.11) 



ha 



Let a = l/\ogXj. By ( |5.10| ) and the elementary estimate Yli2iihiiy ^ ^ ^ V'^' 



i-i 



=0 ^ •'^ /i9>2 (geli) 



i-1 / \ a 



1=0 



J + 1 

i + 1 



The last sum is ^ + 1) ^ 2j. Also, l"o2''i ■ . . ^ j^'j and j! ^ (j/ey. Hence, 



(5.12) 



log Xj ^ 
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The number of choices for B is (Jp ^ (ekj/bj)''^ . By (jS^, §JT}, (|5^, and Lemma 5.2 



\QiA < 



(log Xj 



(5.13) 



X 



kf^ (cie^ log2 x) 



log Xj 



fcj+3/2 



/ 2efcf(fc,/&,)log^fc,(j + l) ^' 
I logx 



•C X exp|o(A;j logs ^) + 



(3 + ri) log fcj + log ( ^ ) - log2 X 



We next estimate Place each variable hi into an interval J,. If hi ^ Aj, then take 

Ji = (2'^~^, 2'^] for an integer k ^ 1, and if hi > Aj, then take 

j, = {[A,{i + i/k,y^-'\,[A,{i + i/k,y^\] 

for some integer k ^ 1. For brevity, write Jj = (Hi,Ki] for each i. Since Ki — Hi ^ Hi/{2kj), 
there are at most 6j values of i with Ki — Hi ^ Aj. Lemma [54| then gives 



^ ^ &{hk^,. . . ,ki) ^ 



All ■ • ■ /il. i/l ■ • ■ Hk ^ 



•C (calog/cj )^^ exp <^ O 



fej log2 kj 
log % 



n 



i=l 



By our definition of the intervals Ji, 



T^Ki-H, 



i=l 



H 



n 2 E n ^ 



hi---hk 



Thus, after summing over all possibilities for Ji, . . . , J^^, we obtain by ( |5.9[ ) 

' k^ logo fc,- 



(logXj)''j + ^ 



log A;, +^^^°S2A^. 



where ■ ■ ■ h'f^. ^ 2^^ ^^fc,+i ■ ■ ■ /^fej ^ 2^^Xi/xj for ^ i ^ j — 1. For positive ao, . . . , aj-i. 
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If we ignore the factors 2^^^"% the optimal choice of parameters is 

(i + 2)''(i + 1)'^ (i + l)''^'' 



(j + 1)^ logXj [{i + 2)'^ + 1)'' {i + l)*? - i^J ' 
Since {i + 2)'' - {i + 1)'' ^ r]{i + 2)''-\ 

.rj{i + l)*? logXj ' ?7(j + 1)^ \ogXj 
Recalling kj = jl, the sum on /i'^, . . . , /i'^ is at most 



^ = 0,...,j-l. 



ao + • • • + tti £ 



2fcj(«oH 



exp 



i=0 



J + 1 
i + 1 



log Xj 



77(j + l)^logXj 



The exponential factor is = e'-' = e'^^ and (j + 1)! ^ e ^(j + 1)-'^^, so 



y — ^ 



Therefore, 
(5.14) 



+ log, k, 

log kj 



Finally, put hj = [kj/ (log2 kj^l , and sum the inequalities ( |5.13[ ) and ( |5.14| ) for 1 ^ j ^ r. □ 

Proof of Theorem^ Let = 0.15718, / = [(loga;)^J and r = [(logx)^J , where e and (3 are fixed 
and satisfy < e + (3 < Then log Xr x (log For the primes j9 not counted in Theorem 

^ the primes at level rl of the Pratt tree are all < Xr, so H{p) ^ + 1 + r/ < (logx)°-^^°^^ 
if we take (3 sufficiently close to By Theorem |7| the number of exceptional primes p ^ x is 
0(xexp{ — (logx)''}) for some 5 > 0. □ 

Proof of Theorem^ Let x be large, x/ logx < n ^ x and suppose there is a prime p > x^/^ such 
that Then either (i) there is a prime q > x'^l'^ and ^ j ^ A; such that g^|0j(n), or (ii) 

there is a prime chain p = Pk ~< Pk-i -<■■■-< Pi -< Po with po\n. In case (i), let j be the smallest 
such index. Using the uniform estimate 

logaX 



p=l (mod m) 



p 0(m) 



coming from the Brun-Titchmarsh inequality, the number of integers in category (i) is 



E 



q>x' 



.e/2 



i=l x~/'^<q^x pj_i=l (mod (7^) pj_2 = l (mod pj_i) po=l (mod pi) 
Pj_is£a; pj_2s£x posSa; 



X 



X 



l-e/2 



logx 



i=i 



logx 
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Consider n in category (ii). Take r] = |, let r be the smallest integer with (r + 1) < e/2, let / be 
sufficiently large, / ^ logg x and k = rl. By Theorem|7} for x^^'^ < y ^ x, the number ofpo^y is 
0(y/ log^ 1/ + l/(2?7e^+'')^'/^/ logy). By partial summation, the number of n is x{2rie^'^'^)~''^'^. 
Taking I large enough, depending on e and 5, completes the proof. □ 

6. Stochastic model of Pratt trees 

In this section, we develop a model of the Pratt trees which explains Conjectures [2]and[3j Factor 

n as n = Hjli'' Pji.'^)^ with piiji) ^ P2{n) ^ ■ ■ ■ . Put Pj{n) = 1 for j > Q,{n) and let 

_ f \ogpi{n) \ogp2{n) 
\ log n log n 

The distribution of the first component of S{n) has been greatly studied, the results having wide 
application in the theory of numbers (see e.g. the comprehensive survey article ll26l ). We have^ 
P(log]9i(n) ^ ^ logn) = p{u), where p is the Dickman function, the unique continuous solution 
of the differential-delay equations p{u) = 1 (0 ^ m ^ 1), up'{u) = —p{u — 1) (m > 1). The com- 
plete distribution of S{n), found by Billingsly in 1972 [[TT|. corresponds to the Poisson-Dirichlet 
distribution with parameter 1, PD(1) for short (more precisely, for each j, the first j components 
of S{n) are distributed as the first j components of the PD(1) distribution). The joint distribution 
of the components in the PD{\) distribution can easily be expressed in terms of p. There is a 
simpler characterization of the distribution, found by Donnelly and Grimmett [17J. Let f/i, f/2, . . . 
be independent random variables with uniform distribution on [0, 1]. Let x = (xi, X2, . . .) be the 
infinite dimensional vector formed from the decreasing rearrangement of the numbers 

(6.1) |/l=f/l, Z/2 = (1 - f/l)f/2, y3 = (l-f/l)(l-f/2)f/3,.... 

Then x has the PD{1) distribution. The paper IfTTl gives a simple, transparent proof that (xi, . . . , Xk) 
and the first k components of S{n) have the same distribution. 

Since ^ Xj = 1 with probability 1 , we can interpret the PD ( 1 ) distribution as a random partition 
of the unit interval [0, 1] into an infinite number of parts achieved by cutting [0, 1] at a random place 
(with uniform distribution), then cutting the right sub-interval at a random place, and so on. 

Conjecture 5. As p runs over the set of primes, S{jp — 1) has PD{1) distribution. 

Conjecture |5] is widely believed, and is a simple consequence of EH. Unconditionally, we know 
little about primes in progressions to very large moduli. Assume that S{p — 1) has PD{1) distribu- 
tion, S{q — 1) has PD{1) distribution for each prime q\{p — 1), the vectors S{q — 1) for q\{p — 1) 
are independent, and so forth. The primes in the first level of the tree, on a logarithmic scale, 
correspond to a random partition of [0, 1]. The primes in the second level correspond to randomly 
partitioning each of the parts of the original partition, etc. The entire procedure corresponds to 
what is known as a discrete-time random fragmentation process. Random fragmentation processes 
have been used to model a variety of physical phenomena (e.g., genetic mutations, planet forma- 
tion) and the growth of certain data structures in computer science. Discrete time fragmentation 
processes may be recast in the language of branching random walks, which we now describe. 



%B C C N, we say P(n (^B\nC,A)^a if lim ^ ^ ■ " ^ 41 
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Let Mn be the size of the largest object at time n. Then M„ is a model of Qn ■= where g„ 
is the largest prime at level n of the tree. The event {Mn < |^} is a model of the statement "all 
the primes at level n of the Pratt tree for p are < 2"; that is, Hip) < n. Thus, H{p) is modeled by 
the random variable where T(£:) = min{n : M„ ^ e}. 

Assuming EH, Lamzouri [28] showed that Qn has the same distribution as M„ for each fixed n 
(he studies the distribution of P+(0„(m)) for all integers m; the same proofs give the distribution 
of Qn)- Further, on EH, Lamzouri shows that P{(5n ^ = P{^^n ^ = Pn{u), where, for 
each fixed n. 



(6.2) pn{u) = ( , , ) -> oo; 



with logQ(u) = u. Our goal is to understand the distribution of M„ as n — )■ oo. 

Create a tree structure from the random fragmentation process as follows: label the root node 
with zero, beneath the root node put an infinite number of child nodes, each corresponding to 
one of the fragments of the initial segment [0, 1]. Each of these nodes has an infinite number of 
child nodes, corresponding to the fragments in the second step of the process, and so on. Each 
node is labeled with the number — log x, where x is the fragment size. This randomly labeled 
tree corresponds to a branching random walk (BRW). More generally, an initial ancestor is at 
the origin, and who forms the zeroth generation. This parent then produces children, the first 
generation, which are randomly displaced from the parent according to some law. Each of these 
children behaves like an independent copy of the parent, their children randomly displaced from 
their parent according to the same law, and forming the second generation, and so on. In our case, 
each parent produces an infinite number of offspring, the displacements from their parent given by 
V = {—logy : y E Z}, where Z is a point set with PD{\) distribution. We'll say that V has 
LPD (logarithmic Poisson-Dirichlet) distribution from now on. 

Let Bn be the minimum label of an individual at time n, so that Bn = — log M„. The first order 
behavior of the analog of Bn (law of large numbers) for a general BRW was determined in the 
1970s by Biggins, Hammersley and Kingman (see \Ql). In our case. Biggins' theorem [|9| implies 

i?„ ~ ^ as n — )■ oo almost surely. Thus, T ^j^j ~ elog2P as p — )■ oo almost surely, which 

justifies Conjecture [2j 

Let hn = median(_B„). The study of Bn naturally breaks into two parts: (i) global behavior: 
asymptotics for 6„, and (ii) local behavior: the distribution of Bn — hn- A result of McDiarmid BH 
can be used to prove = f + 0(log n), and this was sharpened by Addario-Berry and Ford ([U to 

Theorem 8. We have 6„ = f + ^ logn + 0(1). 

Corollary 2. We have median{T{e)) = elog(l/£) - | log2(l/£:) + 0(1). 

This justifies part of Conjecture [3| One ingredient in the proof is the following expectation 
identity. Let Zn{t) be the number of generation n individuals with position ^ t, and let z^''^ be the 



set of positions of generation n individuals. If v = {vi, V2, - - -) has PD{1) distribution, ( |6.1| ) gives 

oo oo oo _ _ 

E E -I = E - ^0 ■ ■ ■ (1 - u,.,)u,y = J2 (TT^ = -: 
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since EU^ = E(l — UiY = 1/(1 + s). By the branching property, 

E e-"^" = E e-"^"-iE ^ e'^^^ = -E ^ e'^^'^-K 

By induction, the left side is 1/s", so e-'^dEZn{t) = 1/s". Therefore, EZn{t) = f'/n\. 
Because /n\ 1 whent = ^ + ^ log ?t, + 0(1), a naive guess would be 6„ = ^ + ^ logn + 0(l). 
However, for reasons clearly explained in [|2l and executed in |[T|, the leftmost point in the n-th 
generation of a branching random walk has an atypical ancestry with high probability. Denote the 
locations of points in the ancestral line of this leftmost point by 0, zi, ^2, • • • , -Zn = Bn with Bn 
close to hn- Then usually Zj ^ — 0(1) (1 ^ j ^ n/2). A randomly chosen point Zj G z^"^ 
has this property with probability of order 1/n, so the expected number of such Zj is in fact of 
order t"/ {n ■ n!), which is ^ 1 when ^ ^ f + |; log?^ + 0(1). 

We next discuss the local behavior of Under very general conditions on the BRW, it is 
known that Bn — is a tight sequence."^ The basic idea is that a single individual will, with high 
probability, produce many offspring a few generations later which are close by. In our situation, 
tightness on the left for H{p) is relatively easy to prove unconditionally: 

Proof of Theorem^ The conclusion is trivial if g{x^^'^) ^ SK, so we will assume that giyX^/"^) > 
3K. Let m = [gi^x^^"^) / K \ so that m ^ 3. Put Q = x"^ and let T be the set of primes 
a;i/2 < p ^ x such that there is a prime q\{p—l) with Q < q ^ x^l'^ and Hicf) ^ hicf). For p G T, 

E{p) ^ 1 + h{q) ^ h{Q) ^ h{x) - mK ^ /i(p) - ^(p), 

while by sieve methods (Theorem 4.2 of [24]), for large x 

logx -L-L V 9/ loga; 

i?(<?)^/i(g) 

It is also known that under certain conditions on the displacement law of the BRW (e.g. [|3l), the 
analog of B^ — hn converges in probability to a random variable as n — > oo. This is not known in 
our case. 

Conjecture 6. B^ — bn ^ X as n ^ oo for a random variable X with continuous distribution. 

If X exists, and the medians satisfy bn+i — bn e^^ as n — oo (plausible in light of Theorem 

([8])), it is easy to see that X = —1/e + mmi{zi + Xi), where {zi, Z2, . . .) has LPD distribution, 

Xi, X2, . . . are independent copies of X, and = means "has the same distribution as". This follows 
by conditioning on the positions of the first generation individuals (the points Zi); that is, using 

Bn = mini{zi + bI^I^), where bI^I^ are independent copies of -B„-i. The solutions X of this 
recursive distributional equation are not known, however. 

Unconditionally (whether X exists or not), we prove that Bn — bn has an exponentially decreas- 
ing left tail and doubly-exponentially decreasing right tail. Consequently, if Conjecture [6] holds, 
then all moments of X exist. 

sequence Xi,X2, ... of random variables is tight if for every e > there is a number M so that for all j, 
P{\Xj \ > M) ^ e. In other words, the distribution of Xj does not spread out as j — > 00. 
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Theorem 9. (a) For any ci < e, we have 

P{Bn -bn^ -x} e-^i" (n ^ 1, x ^ 0), 
and for any C2 > 2elog(2e) andr] > 0, 

P{Bn - bn ^ -x} ■>c2,v e"^'"" ^ 1, ^ X ^ (l/2e - r])n); 

(b)for any C3 < 1 there is a constant C4, depending on C3, so that 

P{Bn ^bn + x}^ exp (-e'=3(x-c4)j (n ^ 1, x ^ 0). 
Remark 3. By ( |6.2[ ), part (b) is nearly best possible; that is, the conclusion is false if C3 > 1. 

The next two lemmas hold for very general branching random walks. A notable feature is that 
they are local results, and tightness of Bn — bn can be proved without knowing anything about the 
growth of bn- We will use Theorem [8] to prove the stronger tail estimates. 

Lemma 6.1. For positive integers m, n and positive real numbers M, N, 

P{fi„+„ ^M + N}^ E[(P{5„ ^ iV})^-(^^)]. 

Proof. Suppose Bm+n ^ M + N and Zm{M) = k. For each of these k individuals, all of their 
descendants in generation m + n are offset from their generation m ancestor by at least N. □ 

Lemma 6.2. Let m, n be positive integers and let M > 0,e > be real. //"EKl — s)^rniM)j ^ 1^ 
then P{Bn ^ bn+m — M} ^ e. In particular, the conclusion holds ifP{Zm{M) < l/e} ^ |. 

Proof. Let q be the e-quantile of Bn, that is, P{-B„ ^ q} = By Lemma 6.1[ 

P{Brn+n ^ M + q} ^ E [(P{5„ ^ g})^-(*^)l 



1 

2 



Therefore, M + q ^ bm+n, and thus P{i?„ ^ bm+n — M} ^ P{Bn ^ q} = e. To prove the 
second part, assume that P{Zm(M) < l/e} ^ |. Then 

E {(1 - ^ P{Zm{m) <1} + {1- P{Zm{M) < i})(l - ey/' ^ 1 + 1 < 1. □ 

Lemma 6.3. For real t ^ 1 and integer k ^ 1, we have P{Zi(t) ^ k} ^ {et/k)^~^. 
Proof. The conclusion is trivial if A; ^ et, so we suppose k > et. Take s = | — 1. By ( |6.1| ), 

P{Zi(t) ^ A;} ^ P {(1 - t/i) ■ ■ ■ (1 - Uk-i) > e-*} 

^ e"* /" ■ ■ ■ /" [(1 - Ml) • ■ ■ (1 - Mfc_i)]'rfMi ■ ■ ■ duk^i = ^ □ 
Jo Jo {l + s)" 

Lemma 6.4. For all r ^ 1, 6 > 1 and e > 0, if x is large then P{Zr{x) ^ 6^} ^ exp{ — (^^ — sY}. 

Proof. When r = 1, this follows from Lemma [63] Assume it to be true for some r ^ 1, let 
and e be given, and assume without loss of generality that 6 — e > 1. The probability that 

Zr{x) ^ (6* — is ^ exp{ — (^^ — e/2)'^'} for large X. Now suppose ^^(x) =j< (9 — e/Sy and 
Zr+i{x) ^ 0^. Let nii be the number of children of the ?-th largest point in z'^'^^ which are offset at 
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most X from their parent. Let X be the set of indices with mj ^ lOOx. Note that mi + ■ • ■+mj ^ O'^. 



With j, mi, . . . , rrij fixed, by Lemma 6.3 the probability that Zr+i{x) ^ 6^ is at most 



JJP{Zi(x) ^ TTli} ^ JJe-2-« ^ g-2(e--100xi) ^ _ 

As mj ^ e^, the number of choices for j, mi, . . . , m^ is at most exp{(^ — For large x, 

P{Zr{x) ^ ^ exp{-(^ - e/2y} + exp{(0 - - ^'j ^ exp{-(0 - e)^'}. □ 

Proof of Theorem^ Let a > 1/e and < r] < ae/2. By [jTOl Theorem 2], for large r we have 
P{Zr(ar) ^ (ae — 77)^} ^ |. Let r be so large that, in addition, bn+r ^ bn + (1/e — ri)r for all 



n (r exists by Theorem [8]). Apply Lemma 6.2 with M = ar, m = r, e = {ae — r]) For large 
integers r, 

P{5„ ^ 6„ - (a - 1/e + r/)r} ^ P{5„ < - ar} ^ (ae - r^)-^ 

The first estimate follows with ci = ^ziy^Jip^- Fi^ l^t ^ — ^ 0, then let a — )• 1/e, so that ci — )■ e. 

For the second part of (a), take r] > and r as before (but fixed here), and let 5 = (1/e — r])r, so 
that bn+r ^ bn + S for all n. Since p{u) = 1 - logn for 1 ^ u ^ 2, P(^i(£:) ^ 1) = 1 - p(e^) = e 
when ^ £ ^ log 2. Considering the "leftmost child of the leftmost child of the . . .of the initial 
ancestor" in the branching random walk, we have P{Bkr ^ 5k/2} ^ P{Zi(5/2r) ^ 1}'^'" = 
{5/2rY^' for every k ^ 1. Hence, 



P{Bn ^ bn-kr + 6k/2} ^ P{fi„_fe, ^ 6„_fcjP{5fc,, ^ 5k/2} ^ 2 V 2r 
By assumption, h^-kr + 6k/2 ^ bn — 5k/2. Hence, for ^ A; ^ n/r we have 

P{Bn^bn-5k/2}^-(-\ . 

This gives the desired bound when ^ x ^ ^, with c[ = ^ log ^. 
To show part (b), we use induction on n to show that 

(6.3) P{Bn ^bn + x} ^2- <=Mc3(^-c5)} 

for n ^ 1 and x ^ 0, where C5 is sufficiently large. Theorem |9] (c) then follows with C4 = 
C5 — i5Si2S_2 ( |6.3| ) is trivial for ^ x ^ C5, we may assume x ^ C5. Let r, 5 be such that 

C3 ' 



bn+r — 6„ ^ 5 > for all n (the relationship between r and 5 is unimportant). Let A be a large 
integer, so that if R = Ar and A = AS, then 2e^^'^ ^ 1 — C3. Also suppose 03^^. For 
1 ^ n ^ R, ( |6.2| ) implies P{-Bn ^ &n + a;} ^ P{-B„ ^ x} = Pn(e^) ^ exp{— e^} if C5 is large 
enough. Suppose now that ( |6.3[ ) has been proved for 1 ^ n ^ m — 1, where m — 1 ^ i?. Define 
Xj = A + for j ^ 1 Let jo be the largest index j with ^ x + A. Let 2:1 ^ ^2 ^ • ■ ■ be 

the points in z*^^). For 1 ^ j ^ jo, let Pj be the event {zi > Aj (i < j), Zj ^ Xj}, and the Q be the 
event {zi > Aj (1 ^ i ^ jo)}- If -Pj' then the generation m points descending from each of the j 
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points . . . , are offset from their generation R ancestor by at least hm + x — \j. So 

jo 

P{S™ ^h^ + x}^ ^P[F,]P{5„_R ^ 6™ + X - XjY + p[g]. 

i=i 

Since 6^ ^ 6m-i? + the induction hypothesis implies that the sum on j is 

jo jo 
^ p|^p^.j2-jexp{c3(a;-C5+A-Aj)} ^ p |^p^.j2- cxp{c3(x-C5)+l} ^ 2^1-'=^P{'=3(2;-C5)} 

j=l j=l 

Now suppose that Q holds. By the assumption on A, 

30 oo ^ 

j^jo j=l j=l 

As Aj„ ^ X + A - (Ajo+i - AjJ ^ X, 

Let e = |(e — e'^^) and 6* = e — £, so that e"^^ < — £ < 6* < e. For some integer k ^ x, there are 
^ 9'^ points of z*^^^ m[k — l,k), for otherwise 



eV 1 



By Lemma |64| P[Q] ^ EfcP{^i?('^) > < Se^^''-")". This completes the proof of (b). □ 
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